Unifying Lexical Resources
نویسندگان
چکیده
This paper describes our efforts to create a Unified Lexicon by extracting information from a variety of external resources, namely our XLE syntactic lexicon, WordNet, Cyc, and VerbNet. The UL is built in several steps: first the information is extracted from the resources; then it is merged into lexical entries based on word stem, syntactic subcategorization frame, meaning concept, and WordNet class; finally, patch files are run over the UL to create a cleaner version. The patched version of the UL is used to extract semantics to KR mapping rules, including default rules for where gaps occur in the external resources. This paper focuses on unifying lexical resources for verbs.
منابع مشابه
Ensuring Semantic Interoperability on Lexical Resources
In this paper, we describe a unifying approach to tackle data heterogeneity issues for lexica and related resources. We present LEXUS, our software that implements the Lexical Markup Framework (LMF) to uniformly describe and manage lexica of different structures. LEXUS also makes use of a central Data Category Registry (DCR) to address terminological issues with regard to linguistic concepts as...
متن کاملDirect and Indirect Linking of Lexical Objects for Evolving Lexical Linked Data
Servicization of language resources in a Web-based environment has opened up the potential for dynamically combined virtual lexical resources. Evolving lexical linked data could be realized, provided being recovered/discovered links among lexical resources are properly organized and maintained. This position paper examines a scenario, in which lexical semantic resources are cross-linguistically...
متن کاملA Robust Approach to Aligning Heterogeneous Lexical Resources
Lexical resource alignment has been an active field of research over the last decade. However, prior methods for aligning lexical resources have been either specific to a particular pair of resources, or heavily dependent on the availability of hand-crafted alignment data for the pair of resources to be aligned. Here we present a unified approach that can be applied to an arbitrary pair of lexi...
متن کاملThe open lexical infrastructure of Spräkbanken
We present our ongoing work on Karp, Språkbanken’s (the Swedish Language Bank) open lexical infrastructure, which has two main functions: (1) to support the work on creating, curating, and integrating our various lexical resources; and (2) to publish daily versions of the resources, making them searchable and downloadable. An important requirement on the lexical infrastructure is also that we m...
متن کاملStandardizing lexical-semantic resources - Fleshing out the abstract standard LMF
This paper describes the application of the Lexical Markup Framework (LMF) for standardizing lexical-semantic resources in the context of NLP. More specifically, we highlight the question how lexical-semantic resources can be made semantically interoperable by means of LMF and ISOCat. The LMF model UBY-LMF, an instantiation of LMF specifically for NLP, serves as an example to illustrate the pat...
متن کامل